首页> 外文OA文献 >Validation of Soft Classification Models using Partial Class Memberships: An Extended Concept of Sensitivity & Co. applied to the Grading of Astrocytoma Tissues
【2h】

Validation of Soft Classification Models using Partial Class Memberships: An Extended Concept of Sensitivity & Co. applied to the Grading of Astrocytoma Tissues

机译:使用部分类验证软分类模型   会员资格:适用于评分的敏感度和公司的扩展概念   星形细胞瘤组织

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We use partial class memberships in soft classification to model uncertainlabelling and mixtures of classes. Partial class memberships are not restrictedto predictions, but may also occur in reference labels (ground truth, goldstandard diagnosis) for training and validation data. Classifier performance is usually expressed as fractions of the confusionmatrix, such as sensitivity, specificity, negative and positive predictivevalues. We extend this concept to soft classification and discuss the bias andvariance properties of the extended performance measures. Ambiguity inreference labels translates to differences between best-case, expected andworst-case performance. We show a second set of measures comparing expected andideal performance which is closely related to regression performance, namelythe root mean squared error RMSE and the mean absolute error MAE. All calculations apply to classical crisp classification as well as to softclassification (partial class memberships and/or one-class classifiers). Theproposed performance measures allow to test classifiers with actual borderlinecases. In addition, hardening of e.g. posterior probabilities into class labelsis not necessary, avoiding the corresponding information loss and increase invariance. We implement the proposed performance measures in the R package"softclassval", which is available from CRAN and athttp://softclassval.r-forge.r-project.org. Our reasoning as well as the importance of partial memberships forchemometric classification is illustrated by a real-word application:astrocytoma brain tumor tissue grading (80 patients, 37000 spectra) for findingsurgical excision borders. As borderline cases are the actual target of theanalytical technique, samples which are diagnosed to be borderline cases mustbe included in the validation.
机译:我们在软分类中使用部分类别成员资格来对不确定标签和类别混合进行建模。部分班级成员资格不限于预测,还可以出现在用于培训和验证数据的参考标签(真实情况,金标准诊断)中。分类器性能通常表示为混淆矩阵的分数,例如敏感性,特异性,阴性和阳性预测值。我们将此概念扩展到软分类,并讨论扩展性能度量的偏差和方差性质。歧义索引标签可转换为最佳情况,预期情况和最坏情况之间的差异。我们展示了比较预期性能和理想性能的第二组度量,后者与回归性能密切相关,即均方根误差RMSE和均值绝对误差MAE。所有计算都适用于经典明快分类以及软分类(部分类成员资格和/或一类分类器)。拟议的绩效衡量标准允许使用实际的临界情况测试分类器。另外,硬化例如。不必将后验概率归类到类别标签中,从而避免了相应的信息丢失并增加了不变性。我们在R包“ softclassval”中实施了建议的性能指标,该包可从CRAN和http://softclassval.r-forge.r-project.org获得。我们的推理以及部分成员资格对于化学计量学分类的重要性通过一个真实的字词应用得以说明:星形胶质瘤脑肿瘤组织分级(80例患者,37000个频谱)用于发现手术切除边界。由于临界情况是分析技术的实际目标,因此在验证中必须包括被诊断为临界情况的样品。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号